Model Reuse with Bike Rental Station Data
نویسندگان
چکیده
This paper describes the methodology used for ECMLPKDD 2015 Discovery Challenge on Model Reuse with Bike Rental Station Data (MoReBikeS). The challenge was to predict the number of bikes in the new stations three hours in advance. Initially, the data for the first 25 new stations (station 201 to 225) was provided and various prediction methods were utilized on these test stations and the results were updated every week. Then the full test data for the remaining 50 stations (station 226 to 275) was given and the prediction was made using the best method obtained from the small test challenge. Several methods like Ordinary Least Squares, Poisson Regression, and Zero Inflated Poisson Regression were tried. But reusing the linear models learnt from the old stations (station 1 to 200) with lowest mean absolute error proved to be the simple and effective solution.
منابع مشابه
Model Reuse with Bike rental Station data (Preamble)
1 The challenge Adaptive reuse of learnt knowledge is of critical importance in the majority of knowledge-intensive application areas, particularly when the context in which the learnt model operates can be expected to vary from training to deployment. In the MoReBikeS challenge (Model Reuse with Bike Rental Station Data) that we organised as the ECML-PKDD 2015 Discovery Challenge #1, we decide...
متن کاملExploring spatiotemporal features of station-free bike sharing trips: case study of Shenzhen
The emerging station-free bike sharing schemes create large quantities of spatiotemporal data and provide opportunities for urban and transportation studies. These schemes are different from traditional city rental schemes (they have no docking stations, etc.). They have at their core a continuous GPS system and generate large amounts of spatially located data for each bike. This study proposes...
متن کاملGaussian Process Model Re-Use
Consider the situation where we have some pre-trained classification models for bike rental stations (or any other spatially located data). Given a new rental station (deployment context), we imagine that there might be some rental stations that are more similar to this station in terms of the daily usage patterns, whether or not these stations are close by or not. We propose to use a Gaussian ...
متن کاملSong, H., & Flach, P. (2015). Model Reuse with Subgroup Discovery. In Proceedings of the ECML/PKDD 2015 Discovery Challenges: co-located with European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML-PKDD 2015) (CEUR
In this paper we describe a method to reuse models with Model-Based Subgroup Discovery (MBSD), which is a extension of the Subgroup Discovery scheme. The task is to predict the number of bikes at a new rental station 3 hours in advance. Instead of training new models with the limited data from these new stations, our approach first selects a number of pre-trained models from old rental stations...
متن کاملBike-Share Systems: Accessibility and Availability
The cities of Paris, London, Chicago, and New York (among others) have recently launched large-scale bike-share systems to facilitate the use of bicycles for urban commuting. This paper estimates the relationship between aspects of bike-share system design and ridership. Specifically, we estimate the effects on ridership of station accessibility (how far the commuter must walk to reach a statio...
متن کامل